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DETERMINATION  OF  LAND  USE  FROM  LANDSAT  IMAGERY: 
APPLICATIONS  TO  HYDROLOGIC  MODELING 


The  Hydrologic  Engineering  Center 
U.S.  Army  Corps  of  Engineers 
Davis,  California 


INTRODUCTION 

The  purpose  of  this  report  is  to  document  work  completed  over  the  past  year 
at  the  Hydrologic  Engineering  Center  (HEC)  in  the  area  of  "remote  sensing 
and  land  use."  This  work  has  been  performed  in  a  cooperative  effort  with  the 
University  of  California,  Davis  (UCD)  and  the  National  Aeronautics  and  Space 
Administration  (NASA)  as  part  of  a  NASA  Water  Management  and  control  ASVT 
(Application  Systems  Verification  and  Transfer).  The  project  participants 
are  listed  in  Appendix  A. 

One  of  HEC's  mission  objectives  is  to  take  new  and  promising  technology  that 
is  applicable  to  hydrologic  engineering  and  water  resources  planning,  test 
it  in  a  "real  world"  application,  and  (if  results  are  satisfactory)  make  it 
conveniently  available  to  Corps  of  Engineers'  field  offices.  The  use  of 
remote  sensing,  in  particular  the  LANDSAT  series  of  satellites,  to  determine 
land  use  for  input  to  hydrologic  models  is  an  example  of  the  technology 
transfer  from  the  research  community  to  the  working  environment  of  the  Corps 
of  Engineers . 

The  various  components  of  the  NASA  ASVT  project  have  been  consistent  with 
this  philosophy  of  providing  a  useable  product.  UCD  developed  an  operational 
procedure  for  land  use  classification  using  LANDSAT  digital  data  and  tested 
it  on  four  watersheds.  HEC  has  tested  the  procedure  on  two  additional 
watersheds,  has  compared  LANDSAT  and  conventional  land  use  in  four  basins, 
and  has  determined  the  hydrologic  significance  of  the  difference  between 
LANDSAT  and  conventional  land  use  classifications  for  three  of  these  basins 
(Cermak,  1979).  HEC  has  also  contracted  with  a  commercial  organization  to 
classify  one  of  the  same  watersheds  that  has  previously  been  analyzed  using 
the  UCD  procedure,  thus  permitting  an  additional  comparison  of  output 
product  quality  and  cost.  NASA  has  engaged  a  consultant  to  conduct  a  formal 
cost-effectiveness  study  of  LANDSAT  land  use  classification,  to  which  HEC 
has  provided  cost  and  labor  information. 

This  report  will  begin  with  an  explanation  of  the  relationship  between 
hydrologic  modeling,  land  use,  and  LANDSAT.  Next,  the  use  of  grid  cell  data 
banks  and  HEC's  spatial  data  management  system  for  comparing  alternative 
land  use  classifications  are  described.  A  general  discussion  of  LANDSAT  and 
conventional  land  use  comparisons  will  precede  a  detailed  land  use  analysis 
of  five  study  basins.  HEC's  experience  with  the  commercial  classification  of 
some  of  these  same  basins  will  be  reported,  followed  by  a  tabulation  of 
relevant  cost  data.  Finally,  hydrologic  simulations  of  three  watersheds, 
based  on  both  LANDSAT  and  conventional  land  use,  will  be  described.  From 
this  analysis,  an  evaluation  will  be  given  of  the  use  of  LANDSAT  land  use  in 
the  hydrologic  modeling  process. 
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HYDROLOGY,  LAND  USE,  AND  LANDSAT 


The  hydrologic  modeling  of  a  watershed,  particularly  urban  or  urbanizing 
basins,  requires  that  the  distribution  of  land  use  be  determined.  The  amount 
and  timing  of  runoff  is  directly  related  to  the  infiltration  capacity  of  a 
land  area  with  the  most  important  distinction  being  between  pervious  and 
impervious  land  surfaces.  Water  quality  parameters  have  a  similar  dependence 
on  land  use  data;  rate  of  accumulation  of  a  particular  pollutant  per  unit 
area  is  normally  expressed  as  a  function  of  land  use.  Water  resource 
planning  studies  are  interested  in  not  only  an  assessment  of  the  present 
state  of  the  water  and  related  resource  system,  but  also  its  possible  future 
configuration.  By  expressing  hydrologic  parameters  as  a  function  of  current 
land  use  it  becomes  possible  to  rationally  predict  the  impact  future  land 
use  changes  will  have  on  the  quantity  and  quality  of  future  runoff. 

Manual  methods  for  land  use  identification  (e.g.,  interpretation  of  low 
altitude  aerial  photography  and  field  surveys)  are  frequently  used  in 
watershed  studies.  With  this  approach,  the  resource  requirements,  both  money 
and  labor,  for  manual  classification  can  be  extensive.  An  attractive 
alternative  is  the  utilization  of  available  remote  sensing  systems  and 
computer-assisted  classification  techniques. 

The  LANDSAT  satellites  have  been  shown  to  have  the  capability  of  providing 
land  use  data  at  acceptable  levels  of  accuracy  for  hydrologic  modeling 
purposes  (Algazi,  1977b;  Burgy,  1974;  Jackson,  1977;  Ragan,  1975).  LANDSAT 
data  is  quicker  and  less  costly  to  obtain  and  interpret  than  low  altitude 
aerial  photography,  provides  repetitive  coverage  of  the  same  area  at  least 
every  18  days,  and  is  available  for  all  United  States  and  many  worldwide 
locations.  Additionally,  LANDSAT' s  digital  format  can  be  directly  analyzed 
by  several  available  classification  computer  programs,  and  can  be  resampled 
for  automatic  inclusion  in  a  geographic  data  bank. 

Different  physical  features  on  the  earth's  surface  reflect  different  amounts 
of  light  in  certain  wavelength  intervals.  This  is  the  basic  principal  upon 
which  LANDSAT 1  operates.  By  identifying  a  spectral  pattern  or  "signature" 
that  is  consistently  associated  with  a  particular  land  use,  it  is  possible 
to  classify  major  land  cover  categories  over  large  areas  directly  from  the 
LANDSAT  data.  "For  example,  vegetation  typically  reflects  more  green  light 
than  red  and  is  more  reflective  (bright)  in  the  infrared.  Many  dry  soils,  by 
contrast,  reflect  less  light  in  the  green  than  in  the  red  and  moderately 
more  so  in  the  infrared;  wet  soils  show  similar  patterns  of  relative 
reflectance  in  the  four  bands  but  the  magnitude  or  intensity  of  light 
reflected  in  each  band  is  reduced  by  the  general  light-absorbing  character 
of  water"  (Short,  1976). 


^See  Appendix  B  for  background  information  on  LANDSAT's  orbital 
characteristics,  sensor  system,  and  data  formats. 


GRID  CELL  SPATIAL  DATA  MANAGEMENT  SYSTEM 


The  Corps  of  Engineers'  Expanded  Flood  Plain  Information  Studies  (XFPI) 
provided  a  mechanism  for  evaluating  LANDSAT  land  use  classifications.  The 
XFPI  pilot  studies  undertaken  by  several  Corps  district  offices  analyzed  the 
hydrologic,  economic,  and  environmental  aspects  of  existing  and  alternative 
future  land  use  patterns.  The  analyses  included  the  automatic  computation  of 
expected  annual  flood  damages  which  resulted  from  changed  hydrologic 
responses  and/or  stage-damage  functions  as  related  to  each  land  use  pattern 
and  flood  management  measure.  Example  XFPI  studies  are  described  in  U.S. 

Army  Corps  of  Engineers  (1975;  1978c). 

A  comprehensive  spatial  data  management  system  developed  at  the  HEC  (Davis, 
1978)  was  used  in  the  XFPI  studies  for  the  following  purposes:  (1)  to 
encode,  check  and  place  geographic  map  data  in  a  grid  cell  data  bank;  (2)  to 
display  the  data  through  plotting  maps  of  one  or  more  variables  by  their 
absolute  values  or  by  weighted  combinations  of  relative  attractiveness;  and 
(3)  to  extract  data  from  the  data  bank  and  to  formulate  parameters  for 
hydrologic,  economic  and  environmental  simulation  programs.  The  principal 
computer  programs  for  data  bank  input,  hydrologic  parameter  identification 
and  watershed  simulation,  and  their  relation  to  the  UCD  LANDSAT  land  use 
classification  programs  are  shown  in  Figure  1. 

Each  XFPI  grid  cell  data  bank  contained  an  existing  land  use  variable  which 
was  classified  by  conventional  means.  This  allowed  an  exhaustive  cell-by¬ 
cell  comparison  between  the  conventionally  classified  land  use  and  the 
corresponding  LANDSAT  land  use  classification  using  the  computer  program  RIA 
(U.S.  Army  Corps  of  Engineers,  1978b).  Hydrologic  parameters,  expressed  as  a 
function  of  several  spatial  variables  including  land  use,  were  calculated 
for  both  the  conventional  and  the  LANDSAT  land  use  classifications  by  using 
the  computer  program  HYDPAR  (U.S.  Army  Corps  of  Engineers,  1978d).  Using  the 
HEC-1  watershed  model  (U.S.  Army  Corps  of  Engineers,  1973),  which  had  been 
calibrated  to  each  of  the  individual  basins  by  the  responsible  Corps 
district  office,  the  significance  of  the  hydrologic  differences  between  the 
two  land  use  classifications  was  assessed. 

UCD  PROCEDURE 

An  operational  procedure  for  land  use  classification  from  LANDSAT  data  has 
been  developed  at  the  University  of  California,  Davis  (UCD)  for  use  by  the 
Corps  of  Engineers.  Referred  to  as  the  UCD  Procedure,  it  was  designed  to 
function  under  the  following  constraints: 

(1)  Use  of  dedicated,  interactive  image  processing  facilities  would 
not  be  required.  Only  output  equipment  normally  available  in  Corps'  field 
offices  (e.g.,  line  printer)  and  batch-mode  access  to  a  general  purpose 
computer  could  be  expected. 

(2)  Would  not  require  additional  software  beyond  that  provided  as  part 
of  the  procedural  package. 

(3)  Would  not  require  specialized  technical  expertise  in  data 
analysis,  computer  programming,  or  remote  sensing. 
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Figure  1  SCHEMATIC  OF  PROCESSING  SYSTEM 


(4)  Would  provide  a  usable  product;  i.e.,  one  that  can  conveniently  be 
entered  into  a  grid  cell  data  bank  and  that  will  adequately,  from  a 
hydrologic  viewpoint,  represent  current  land  use  conditions. 

The  UCD  procedure  consists  of  an  organized  set  of  computer  programs  and 
manual  operations  for  the  identification  of  land  use  from  raw  LANDSAT  data. 

A  detailed  description  of  the  procedure  is  given  by  Algazi  (1979)  and  Meyer 
(1978).  What  follows  is  a  brief  outline  of  the  primary  tasks: 

(1)  Obtain  LANDSAT  Computer-Compatible  Tapes  (CCT),  NASA  high  altitude 
aerial  photography,  and  USGS  topographic  maps  for  the  location  and  date  of 
interest.  Extract  a  rectangular  area  of  data  containing  the  watershed  from 
the  CCT.  Check  for  radiometric  errors  in  the  LANDSAT  digital  data  and,  if 
necessary,  correct. 

(2)  Determine  the  geometric  registration  of  the  LANDSAT  image  with  the 
coordinate  system  of  the  topographic  maps.  LANDSAT  control  points  are 
identified  from  the  output  of  a  UCD  computer  program  which  enhances  roads 
and  water  bodies  found  in  the  LANDSAT  image.  A  regression  equation, 
estimated  from  the  two  sets  of  control  points,  provides  a  transformation 
mechanism  for  going  between  the  image  coordinate  system  and  the  map 
coordinate  system,  Universal  Transverse  Mercator  (UTM) . 

(3)  Use  an  unsupervised  clustering  algorithm^  to  partition  the 
LANDSAT  four-dimensional  data  space.  Groups  or  "clusters"  are  identified 
that  contain  points  with  spectral  reflectance  values  that  are  similar  to 
members  of  the  same  cluster,  and  dissimilar  to  the  points  of  other  clusters. 
The  clustering  program  is  allowed  to  generate  a  maximum  of  30  clusters.  Each 
pixel  in  the  watershed  data  is  assigned  to  a  cluster. 

(4)  Select  from  a  line  printer  map  of  the  cluster  assignments  six  sets 
of  adjacent  pixels  (spatial  groups),  all  belonging  to  the  same  cluster. 

Their  corresponding  location  on  the  topographic  maps  is  determined  using  the 
transformation  equation  of  step  (2).  Visual  translation,  from  the  map  to  the 
aerial  photographs,  of  the  spatial  group's  location  permits  a  land  use  to  be 
assigned  to  each  spatial  group.  For  clusters  having  a  consistent  land  use 
assigned  to  all  six  spatial  groups,  a  final  land  use  has  been  determined. 

But  for  those  clusters  where  conflicts  exist  between  the  land  use  identified 
with  each  of  the  six  spatial  groups,  further  partitioning  of  the  data  space 
is  required. 

(5)  Clusters  with  conflicting  land  use  assignments  and  clusters  whose 
associated  land  use  could  not  be  determined  from  the  available  maps  and 
photos  are  reclustered  by  repeating  step  (3),  and  given  final  land  use 
assignments  by  repeating  step  (4). 

(6)  At  this  point  the  watershed  data  file  contains  a  land  use 
classification  (typically  5  to  7  categories)  for  all  its  pixels.  The 
watershed  file  is  then  resampled  at  the  grid  cell  centroids  using  a 


^See  Appendix  C  for  a  desciption  of  ISOCLAS,  the  UCD  Procedure's 
clustering  algorithm. 
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nearest-neighbor  algorithm.  The  size  of  the  grid  cells  is  usually  line 
printer  compatible  with  the  scale  of  USGS  7-1/2-min.  topographic  maps. 


(7)  The  resampled  file  is  entered  directly  into  a  grid  cell  data  bank. 
Alternatively,  a  file  containing  the  digitized  (in  UTM  coordinates) 
watershed  boundary  can  be  used  to  mask  the  resampled  file,  leaving  only  the 
grid  cells  within  the  boundary.  Total  acreage  of  each  land  use  class  for  the 
entire  watershed  is  then  computed. 

LANDSAT  -  CONVENTIONAL  LAND  USE  COMPARISON 

The  UCD  Procedure  has  been  applied  by  the  UCD  staff  to  four  watersheds 
(Trail  Creek,  Castro  Valley,  Pennypack  Creek,  and  Rowlett  Creek)  and  by  the 
HEC  staff  to  two  watersheds  (Crow  Creek  and  Walnut  Creek).  Detailed 
comparisons  of  LANDSAT  and  conventional  land  use  classifications  for  these 
basins  will  be  presented. 1  First,  however,  it  is  necessary  to  discuss  what 
factors  influence  land  use  classification  accuracy,  what  is  being  compared, 
and  what  qualifications  should  be  kept  in  mind  before  interpretations  and 
conclusions  are  made  from  the  comparison  results. 

LANDSAT  Land  Use 


The  quality  of  a  LANDSAT  land  use  classification  is  a  function  of  several 
factors;  some  are  within  human  control  but  others  reflect  inherent 
limitations  of  the  sensor  system.  It  has  been  found  that  scale  and  date  of 
ground  truth  data  are  critical  variables.  If  the  scale  of  high  altitude 
aerial  photography  is  too  small  to  identify  land  features  or  if  the  date  of 
the  photography  is  quite  different  from  the  date  of  the  LANDSAT  imagery, 
then  it  becomes  very  difficult  to  make  accurate  land  use  assignments  to  the 
cluster  spatial  groups.  Personal  knowledge  of  a  study  basin  and  experience 
and  skill  of  the  analyst  are  also  important  when  it  comes  to  interpretation 
of  aerial  photography. 

LANDSAT  itself  is  limited  by  its  maximum  resolution  to  areas  no  smaller  than 
a  pixel  (about  1.11  acres).  Spectral  reflectance  is  averaged  over  the  entire 
pixel,  providing  the  opportunity  for  two  very  different  physical  features 
(e.g.,  rooftops  and  lawns)  to  appear  spectrally  the  same  to  LANDSAT  as  some 
third  category  (e.g.,  agricultural  land).  Another  problem  is  that  land  use 
classes  that  we  would  like  to  discriminate  between  may  have  very  similar 
spectral  signatures  (e.g.,  high  density  residential  and  small  commercial 
buildings).  Potentially,  the  algorithm  used  to  identify  spectral  patterns  in 
the  LANDSAT  data  may  influence  the  quality  of  the  final  classification. 

Based  on  our  experience  with  different  classifiers,  however,  there  appears 
to  be  no  major  difference  in  the  accuracy  level  of  the  available 
classification  procedures. 


i LANDSAT  land  use  for  Trail  Creek  has  been  reported  elsewhere  (Algazi, 

1977a)  and  will  not  be  repeated  here.  Because  only  LANDSAT  land  use  has  been 
determined  for  Crow  Creek,  no  comparison  with  conventional  land  use  was 
possible . 
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Conventional  Land  Use 


The  existance  of  grid  cell  data  banks  was  the  criteria  for  selecting  which 
watersheds  would  be  subjected  to  a  land  use  comparison.  Each  data  bank 
(previously  created  by  either  a  Corps  District  or  HEC)  contained  an 
"existing"  land  use  variable  that  had  been  determined  by  conventional  means, 
usually  manual  interpretation  of  low-altitude  aerial  photography.  It  was  to 
this  classification,  referred  to  as  conventional  land  use,  that  the  LANDSAT 
land  use  was  compared.  Obviously  the  meaningfulness  of  the  comparison  would 
depend  on  the  degree  to  which  conventional  land  use  actually  represented 
"real  world"  conditions  on  the  date  the  LANDSAT  imagery  was  taken.  Among  the 
watersheds  examined,  variation  in  quality  of  the  conventional  land  use  was 
noticed.  For  some  basins,  low-altitude  photographic  coverage  was  flown  and 
the  resulting  photo  mosaic  interpreted  specifically  for  encoding  into  a  grid 
cell  data  bank.  But  for  others,  "existing"  land  use  was  determined  by 
encoding  land  use  boundaries  from  local  agency  land  use  planning  maps  and 
USGS  topographic  maps.  Land  use  planning  maps  will  often  show  what  "should 
be"  or  "will  be",  rather  than  what  "is".  Conventional  land  use  based  on  such 
data  may  be  of  questionable  accuracy. 

Land  Use  Comparison 

When  comparing  LANDSAT  and  conventional  land  use,  it  is  important  to 
recognize  that  the  same  land  cover  can  be  interpreted  differently;  i.e., 
conventional  land  use  categories  are  not  always  compatible  with  LANDSAT  land 
cover  categories.  For  example,  the  conventional  category  "transportation/ 
communication/utilites"  includes  major  highways,  right-of-way  for  railroads 
and  power  transmission  lines,  communication  towers,  airport  facilities 
(including  buildings,  runways,  and  vacant  land  within  the  airport  limits), 
and  sewage  treatment  plants.  In  contrast,  LANDSAT  will  recognize  the 
treatment  plant  settling  tanks  as  "water  bodies",  the  open  fields 
surrounding  a  runway  as  one  of  the  vegetation  categories,  and  right-of-ways 
as  whatever  land  cover  class  is  nearby. 

A  comparison  at  the  grid  cell  level  can  be  thought  of  as  a  comparison  at  the 
pixel  level;  both  units  of  area  are  nearly  the  same  size.l  At  this  scale 
errors  introduced  during  the  geometric  correction  and  resampling  steps  of 
the  UCD  Procedure  will  be  erroneously  interpreted  as  LANDSAT  misclassifi- 
cation  errors.  This  will  be  particularly  true  of  land  use  categories  defined 
by  a  small  number  of  adjacent  pixels.  The  border  areas  of  the  larger  land 
use  categories  are  also  susceptible  to  geometric  problems. 

For  hydrologic  modeling  purposes,  accuracy  at  the  grid  cell  level  is  not 
necessary.  Typically  the  smallest  unit  of  analysis  in  hydrologic  modeling  is 
the  subbasin;  Pennypack  Creek,  for  example,  was  modeled  (U.S.  Army  Corps  of 
Engineer,  1978a)  using  subbasins  ranging  in  size  from  0.17  sq.  mi.  (109  ac) 
to  4.03  sq.  mi.  (2579  ac).  In  determining  model  parameters,  land  use  for  the 


*The  pixel  is  approximately  1.11  acres;  grid  cells  for  Castro  Valley, 
Walnut  Creek,  Rowlett  Creek  and  Pennypack  Creek  are  1.53  ac,  1.15  ac,  1.15 
ac  and  0.74  ac,  respectively. 
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individual  grid  cells  is  aggregated  for  all  cells  within  a  subbasin 
boundary.  Further  aggregation  of  subbasin  land  use  gives  the  distribution  of 
land  use  over  the  entire  basin.  It  is  at  the  grid  cell  level  and  watershed 
level  that  comparisons  will  be  made.  The  reader  should  remember  that 
subbasin  accuracy,  the  level  of  most  relevance  to  hydrology,  lies  somewhere 
between  these  two  extremes. 

With  these  qualifications  stated,  we  begin  the  land  use  comparison  analysis. 
Crow  Creek 

HEC  staff  learned  the  mechanics  of  the  UCD  Procedure  by  applying  it  for  the 
land  use  classification  of  Crow  Creek.  A  grid  cell  data  bank  for  Crow  Creek 
had  not  yet  been  constructed  by  the  Rock  Island  District  when  this  report 
was  completed.  Therefore,  only  the  LANDSAT  land  use  classification  can  be 
discussed . 

Crow  Creek,  a  predominately  rural  watershed  located  near  Davenport,  Iowa, 
has  a  drainage  area  of  18.4  square  miles.  Dates  and  specifications  of 
LANDSAT  imagery  and  ground  truth  data  are  listed  below. 

LANDSAT  Imagery: 

18  May  1976,  ID  #  2482-15585. 

Ground  Truth: 

NASA  high-altitude  color  infrared  aerial  photography,  30  Mar 
1973,  scale  =  1:125,000. 

USGS  7-1/2-minute  topographic  maps,  photorevised  1970  and 
1975,  scale  =  1:24,000. 

Mylar  copy  of  Rock  Island  District's  XFPI  base  map,  scale 
=  1:12,000,  (enlargement  of  topographic  maps  above  with 
watershed  boundary  outlined). 

Low-altitude  photo-mosaic  on  screended  Mylar,  July  1977,  scale 

=  1:12,000. 

Seven  land  use  classes  were  identified  from  the  LANDSAT  data:  agricultural, 
residential,  industrial/commercial ,  developed  open  space,  forest, 
undeveloped  open  space,  and  water.  Acreage  and  percent  distribution  in  the 
various  categories  are  given  in  Table  1,  and  illustrated  by  a  line  printer 
map  and  corresponding  topographic  map  in  Figure  2. 

Walnut  Creek 

Walnut  Creek  was  intended  to  provide  a  test  case  for  determining  the 
accuracy  of  LANDSAT  land  use.  It  was  also  a  watershed  that  was  to  be 
analyzed  both  using  the  UCD  Procedure  and  the  services  of  a  commercial 
vendor.  Although  conventional  land  use  had  been  determined  for  the  basin  by 


the  Fort  Worth  District,  a  calibrated  hydrologic  model  had  not  been 
completed  at  the  time  of  this  study.  Therefore,  only  land  use  and  not 
hydrologic  comparisons  could  be  made. 

Table  1 

Crow  Creek  LANDSAT  Land  Use 


Land  Use  Category 

Area  Table 

Acres 

% 

Agricultural 

7,333 

62.2 

Residential 

1,780 

15.1 

Indus trial /Commercial 

78 

0.7 

Developed  Open  Space 

480 

4.1 

Forest 

859 

7.3 

Undeveloped  Open  Space 

1,229 

10.4 

Water 

35 

0.3 

11,794 

100.1 

Walnut  Creek  is  located  near  Austin,  Texas  and  has  a  drainage  area  of  55 
square  miles.  Specifications  for  LANDSAT  imagery,  ground  truth  data  and 
conventional  land  use  used  in  the  analysis  are  listed  below. 

LANDSAT  imagery: 

-  3  May  1976,  ID  #2467-16194. 

Ground  Truth: 

-  NASA  high-altitude  color  infrared  aerial  photography,  21  Jan  1974, 
scale  =  1:121,000. 

-  USGS  7-1/2-minute  topographic  maps,  photorevised  1973,  scale 
=  1:24,000. 

Conventional  Land  Use: 

Interpreted  from  low-altitude  color  infrared  aerial  photography,  Oct 
1977,  scale  =  1:12,000. 

The  location  of  Walnut  Creek  on  the  LANDSAT  scene  is  shown  in  Figure  3.  The 
Walnut  Creek  conventional  land  use  consisted  of  19  land  use  categories, 
listed  in  Table  2  along  with  their  respective  acreage  and  percent 
distribution.  Figure  4  is  a  line  printer  map  of  the  same  classification. 


Table  2 


Walnut  Creek  Conventional  Land  Use 
Area  Table 


Land  Use  Category 

Acres 

% 

Low  density  residential 

1,785 

4.9 

Medium  density  residential 

3,671 

10.0 

High  density  residential 

156 

0.4 

Multifamily  residential 

619 

1.7 

Mobile  home  parks 

126 

0.3 

Strip  commercial 

264 

0.7 

Shopping  centers 

85 

0.2 

Institutional 

536 

1.5 

Industrial 

488 

1.3 

Industrial /commercial  complexes 

742 

2.0 

Public  use 

96 

0.3 

Transportation/ communication/utilities 

1,261 

3.4 

Barren  land/ quarry 

1,022 

2.8 

Cropland 

3,917 

10.7 

Pasture/ ran gel and 

11,327 

31.0 

Developed  open  space 

139 

0.4 

Undeveloped  urban  land 

1,135 

3.1 

Forest 

9,143 

25.0 

Water 

62 

0.2 

36,574 

99.9 

Using  the  UCD  Procedure  six  classes  of  land  use  were  identified  from  the 
LANDSAT  imagery:  cropland/pasture,  residential,  commercial/industrial, 
quarry,  f orest/rangeland,  and  water.  Acreage  and  percent  distribution  for 
the  LANDSAT  classification  are  given  in  Table  3,  as  is  a  line  printer  map  in 
Figure  5. 

Table  3 


Walnut  Creek  UCD  LANDSAT  Land  Use 
Area  Table 


Land  Use  Category 

Acres 

1 

Cropland/Pasture 

15,298 

41.8 

Residential 

9,135 

25.0 

Conner i cal /Indus trial 

1,861 

5.1 

Quarry 

223 

0.6 

Forest/Rangeland 

9,988 

27.3 

Water 

70 

0.2 

36,575 

100.0 
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The  LANDSAT  land  use  classification  was  entered  into  a  grid  cell  data  bank 
that  had  previously  been  constructed  by  the  Fort  Worth  District  for  an 
Expanded  Flood  Plain  Information  Study  (XFPI)  of  the  Walnut  Creek  Basin.  The 
grid  cell  size  of  the  Walnut  Creek  data  bank  is  200  feet  (east-west)  by  250 
feet  (north-south),  or  1.148  acres.  Conventional  land  use  was  also  a  data 
bank  variable.  The  simultaneous  existance  in  the  data  bank  of  land  use 
classified  by  LANDSAT  and  conventional  means  permitted  an  exhaustive 
comparison  of  the  two  approaches. 

For  a  cell-by-cell  comparision  of  the  entire  watershed  it  was  necessary  to 
establish  an  explicit  mapping  of  conventional  land  use  categories  into  the 
fewer  LANDSAT  land  use  categories,  Table  4.  The  RIA  computer  program  (U.S. 
Army  Corps  of  Engineers,  1978b)  generated  the  coincident  matrix,  Table  5. 

The  structure  of  this  crosstabulation  table  is  similar  to  others  that  will 
be  presented  later  in  this  report.  Each  element  of  the  table  (row  and  column 
combination)  refers  to  all  grid  cells  within  the  watershed  data  bank  that 
have  the  concurrent  LANDSAT  and  conventional  land  use  specified  by  the  row 
and  column  headings  of  that  particular  element.  For  example,  the  2nd  row, 

1st  column  of  Table  5  refers  to  all  grid  cells  in  the  Walnut  Creek  data  bank 
that  are  classified  both  commercial/industrial  by  LANDSAT  and  residential  by 
the  conventional  classification.  For  each  element  of  the  table,  four  numbers 
are  given:  (1)  total  acreage  of  all  grid  cells  represented  by  the 
appropriate  joint  land  use  classification;  (2)  row  percent,  or  the  percent 
of  all  grid  cells  with  the  given  LANDSAT  classification  that  have  al so  the 
given  conventional  classification;  (3)  column  percent,  or  the  percent  of  all 
grid  cells  with  the  given  conventional  land  use  that  have  also  the  given 
LANDSAT  land  use;  and  (4)  total  percent,  or  the  percent  of  the  entire 
watershed  that  has  the  given  joint  land  use  classification.  Continuing  our 
example  above,  471  acres  were  found  to  be  classified  both  commercial/ 
industrial  by  LANDSAT  and  residential  by  conventional  means.  This  acreage 
represents  25.3%  of  all  the  area  (1861  ac)  that  was  classified  by  LANDSAT  as 
cotmnercial/industrial ,  7.4%  of  all  the  area  (6359  ac)  that  belonged  to  the 
conventional  residental  category,  and  1.3%  of  the  total  watershed  area 
(36,578  ac). 

The  far  right  column  (row  total)  and  the  bottom  row  (column  total)  of  Table 
5  give  the  marginal  distributions  of  LANDSAT  and  conventional  land  use, 
respectively.  These  represent  the  acres  and  percent  in  the  different  land 
use  categories  without  the  conditional  requirement  described  above  for  the 
body  of  the  table.  The  marginal  distributions  are  exactly  the  same 
information  as  previously  given  in  the  area  tables  (e.g.,  Tables  2  and  3), 
only  now  aggregated  according  to  the  land  use  category  mapping  (Table  4). 

In  interpreting  Table  5,  the  proportion  of  the  entire  watershed  represented 
by  the  grid  cells  that  appear  down  the  upper-left  to  lower-right  diagonal  of 
the  table,  52.7%,  can  be  thought  of  as  a  measurement  of  the  accuracy  of  the 
LANDSAT  classification.  A  strong  qualification  on  the  preceding  statement  is 
that  land  use  classes  are  not  completely  consistent  between  classifications; 
as  a  result,  even  if  LANDSAT  was  100%  "correct",  some  grid  cells  would 
appear  in  off-diagonal  table  elements.  A  second  complication  in  interpreting 
Table  5  is  the  inclusion  of  rangeland  in  the  "cropland/pasture"  conventional 
land  use  category,  and  in  the  "forest"  LANDSAT  land  use  category.  Being 
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unable  to  differentiate  between  them,  cropland,  pasture,  forest,  and 
rangeland  should  be  grouped  as  one  category.  Summing  the  diagonal 
percentages  would  then  result  in  70.8%  of  the  grid  cells  "correctly" 
classified  by  LANDSAT. 


Table  4 


Walnut  Creek 

Land  Use  Category  Mapping 


Conventional  Land  Use 

UCD  LANDSAT 

Battel le  LANDSAT 

Low  density  single  family  residential 
Medium  density  single  family  residential 
High  density  single  family  residential 
Multifamily  residential 

Mobile  home  parks 

residential 

residential 

Strip  comnercial 

Shopping  centers 

Institutional 

Industrial 

Industrial  and  commerical  complexes 

Public  Use:  cemetaries,  public  assembly 
areas,  waste  disposal  areas 
Transportation,  communication,  utilities 

commercial/ 

industrial 

industrial/ 

commercial 

transportation 

Barren  land/quarry 

barren  land/ 
quarry 

barren  land 
quarry 

Cropland 

Pasture/ rangeland 

Developed  open  space 

Undeveloped  urban  land 

cropl and/ 
pasture 

cropl and/ 
pasture 
rangeland 

Forest 

forest/ 

rangeland 

forest 

riparian  veg. 

Water 

water 

water 

Table  5 


Walnut  Creek  Land  Use  Comparison 
UCD  LANDSAT  vs.  CONVENTIONAL* 


Acres 

CONVENTIONAL 

X  Row 

CROP/PASTURE 

ROW 

I  Col 

X  Total 

RES 

COM/ IND 

QUARRY 

/RANGE 

FOREST 

WATER 

TOTAL 

3801 

1098 

225 

3147 

859 

6 

9136 

RES 

41.6 

12.0 

2.5 

34.4 

9.4 

0.1 

100.0 

59.8 

31.6 

22.0 

19. 1 

9.4 

9.5 

25.0 

10.4 

3.0 

0.6 

8.6 

2.3 

0.0 

25.0 

■flj 

816 

mm 

—y 

0 

1861 

COM/ IND 

43.8 

0.0 

100.0 

23.5 

5.1 

H 

2.2 

■a 

1.0 

5.1 

13 

61 

in 

17 

21 

■■ 

223 

UCD 

QUARRY 

5.8 

■Vv-*' 

49.8 

7.6 

9.4 

100.0 

LANDSAT 

0.2 

10.9 

0.1 

0.2 

SIPmsH 

0.6 

0.0 

■SI 

0.3 

0.0 

0.1 

Hi 

0.6 

1581 

1106 

313 

9  324 

2947 

28 

1  5229 

CROP/PASTURE 

10.3 

7.2 

2.0 

60.9 

19.3 

0.2 

24.9 

31 .8 

30.7 

56.4 

32.2 

44.4 

- 

4.3 

3.0 

0.9 

25.5 

8.1 

0.1 

I  y 

493 

361 

3655 

5233 

13 

9989 

FOREST/RANGE 

4.9 

3.6 

36.6 

52.4 

0.1 

100.0 

7.8 

10.4 

22.1 

57.2 

20.6 

27.3 

1.3 

1.0 

HU 

10.0 

14.3 

0.0 

27.3 

0 

31 

8 

9 

6 

mm 

70 

WATER 

0.0 

44.3 

11.4 

12.9 

8.6 

100.0 

0.0 

0.9 

0.8 

0.1 

0.1 

0.2 

0.0 

0. 1 

0.0 

0.0 

O 

o 

0.0 

0.2 

COLUMN 

6359 

34  73 

1021 

16518 

9144 

63 

36578 

TOTAL 

17.4 

9.5 

2.8 

45.2 

25.0 

0.2 

100.0 

100.0 

100.0 

100.0 

100.0 

100.0 

100.0 

100.0 

17.4 

9.5 

2.8 

45.2 

25.0 

0.2 

100.0 

*  Note:  See  page  15  of  text  for  explanation  of  table  structure 


The  above  comparison  at  the  grid  cell  level  is  nearly  equivalent  to  a 
comparison  at  the  LANDSAT  pixel  scale.  An  alternative  comparison,  less 
sensitive  to  geometric  correction  and  resampling  errors,  is  of  the  major 
land  use  categories  at  the  watershed  level.  Looking  at  the  total  percent 
classified  as  (a)  residential,  (b)  commercial/industrial,  and  (c) 
cropland/pasture/f orest/rangeland  by  conventional  and  LANDSAT  shows  a 
difference  of  7.6%,  -4.4%  and  -1.1%,  respectively.  The  average  absolute 
difference  for  the  major  land  use  categories  is  4.4%. 

Rowlett  Creek 

Rowlett  Creek  is  a  137  square  mile,  predominately  rural  watershed  located 
near  Dallas,  Texas.  Rowlett  C  -eek  was  classified  from  LANDSAT  data  by  UCD 
staff  and  the  results  reported  by  Algazi  (1978).  HEC  entered  the  LANDSAT 
classification  into  a  grid  cell  data  bank  (previously  created  by  the  Fort 
Worth  District)  and  compared  it  with  conventional  land  use.  Specifications 
on  the  imagery,  ground  truth,  and  conventional  land  use  are  given  below. 


LANDSAT  Imagery: 

-  3  May  1976,  ID#  2467-16185. 

Ground  Truth: 

-  NASA  high-altitude  color  infrared  aerial  photography,  May  1972, 
scale  =  1: 102,000. 

USGS  7-1/2-minute  topographic  maps,  photorevised  1968  and  1973, 
scale  =  1:24,000. 

Conventional  Land  Use: 

Interpreted  from  low-altitude  black/white  aerial  photography,  March 
1976,  scale  =  1:24,000. 

Area  tables  for  the  LANDSAT  and  conventional  classifications  appear  as 
Tables  6  and  7,  respectively.  Corresponding  line  printer  maps!  are  given 
in  Figures  6  and  7.  Land  use  category  mapping  from  the  larger  number  of 
conventional  classes  into  the  LANDSAT  classes  is  listed  in  Table  8.  Finally, 
Table  9  contains  the  LANDSAT  -  conventional  land  use  comparison  results. 


1  Common  printer  symbols  were  used  for  the  same  land  use  category  in  both 
classifications.  This  practice  was  continued  for  all  similar  pairs  of  land 
use  maps  presented  in  following  sections. 


Table  6 


1  , 

1  Rowlett 

Creek  LANDSAT  Land  Use 

’■  1 

I 

Area 

Table 

Land  Use  Category 

Acres 

% 

’  1  Agricultural 

62,370 

70.8 

<  Natural  Vegetation 

4,659 

5.3 

|  Residential 

17,613 

20.0 

'  '  Commercial/Industrial 

3,409 

3.9 

88,051 

100.0 

4 

Table  7 

Ll  Rowlett  Creek  Conventional  Land 

Use 

y 

Area 

Table 

Land  Use  Category 

k'  ■» 

Acres 

% 

k  Low  density  residential 

1,200 

1.4 

j  Medium  density  residential 

7,810 

8.9 

k,  High  density  residential 

]  Multifamily  residential 

92 

0.1 

201 

0.2 

*|  Mobile  home  parks 

84 

0.1 

i  Developing 

3,554 

4.0 

^  Central  Business  District 

128 

0. 1 

Strip  Commercial 

346 

0.4 

Shopping  Centers 

167 

0.2 

)  Institutional  -  School,  Churches 

716 

0.8 

Industrial 

820 

0.9 

t  Transportation;  Major  Highways 

1,473 

1.7 

Communication 

5 

0.0 

Public  Utilities 

168 

0.2 

Strip  Settlement 

199 

0.2 

Parks  &  Developed  Open  Space 

1,067 

1.2 

'  Cropland 

31,241 

35.5 

Grassland 

30,985 

35.2 

J  Confined  Feeding 

75 

0.1 

W  Barren  Land,  Gravel  Pits 

738 

0.8 

|H  Woodlands,  Forest 

6,869 

7.8 

jT  Water  Bodies 

116 

0.1 

t 

88,054 

99.9 

1 ■  Two  rows  in  Table  9  are  blank.  Although 

identified  as 

target  land  use 

classes,  UCD  staff  were  "unable  to  make  any  assignments  to  the  categories 
developed  open  space  and  water  bodies"  (Algazi,  1978)  because  very  few  land 
areas  representing  these  categories  could  be  located  on  the  aerial 
photographs.  As  a  summary  measure  of  accuracy,  the  diagonal  summation  shows 


« 

I 
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Table  8 


Rowlett  Creek 
Land  Use  Category  Mapping 


Conventional  Land  Use 

LANDSAT  Land  Use 

Low  Density  Residential 

Medium  Density  Residential 

High  Density  Residential 

Multifamily  Residential 

Mobile  Home  Parks 

Developing 

Residential 

Central  Business  District 

Strip  Commercial 

Shopping  Centers 

Institutional  -  School,  Churches 
Industrial 

Transportation,  Major  Highways 

Communi cat i on 

Public  Utilities 

Strip  Settlement 

Comnercial/ Indus trial 

Cropland 

Grassland 

Confined  Feeding 

Barren  Land,  Gravel  Pits 

Agricultural 

Woodlands,  Forest 

Natural  Vegetation 

Parks  and  Developed  Open  Space 

Developed  Open  Space 

Water  Bodies 

Water 

Table  9 

Rowlett  Creek  Land  Use  Comparison 
LANDSAT  vs.  Conventional* 


67.4%  of  the  grid  cells  "correctly"  classified  by  LANDSAT.  The  four  major 
land  use  categories  are  (a)  residential,  (b)  commercial/industrial ,  (c) 
agricultural,  and  (d)  natural  vegetation;  corresponding  percent  differences 
between  conventional  and  LANDSAT  are  5.3%,  -0.7%,  -0.8%,  and  -2.5%, 
respectively.  Average  absolute  error  for  the  major  land  use  classes  is  2.3%. 

Pennypack  Creek 

Pennypack  Creek  is  a  55.8  square  mile  basin  that  drains  part  of 
Philadelphia,  Pennsylvania.  UCD  staff  determined  land  use  from  LANDSAT 
(Algazi,  1978)  and  HEC  entered  the  classification  in  a  grid  cell  data  bank 
(constructed  by  the  Philadelphia  District)  and  made  the  land  use  comparison. 
Details  of  the  LANDSAT  imagery,  ground  truth  data,  and  conventional  land  use 
are  listed  below. 

LANDSAT  Imagery: 

-  19  April  1976,  ID#  2453-14565. 

Ground  Truth: 

-  NASA  high  altitude  color  infrared  aerial  photography,  5  Feb  1974, 
scale  =  1:131,000. 

-  NASA  high  altitude  color  infrared  aerial  photography,  28  Aug  1975, 
scale  =  1:57,000. 

-  USGS  7-1/2-minute  topographic  maps,  photorevised  1973, 
scale  *  1:24,000. 

Conventional  Land  Use: 

1975  tax  maps  for  Bucks  County,  scale:  1"  *  400'. 

-  1976  tax  maps  for  Philadelphia,  scale:  1"  =  200'. 

1975  aerial  photography  for  Montgomery  County,  scale:  1"  *  400' . 

-  Entire  basin  verified  against  1977  aerial  photography,  scale: 

1"  =  400'. 

Area  tables  for  LANDSAT  and  conventional  classifications  are  shown  in  Tables 
10  and  11,  respectively.  Corresponding  line  printer  maps  are  given  in 
Figures  8  and  9.  Land  use  category  mapping  between  the  two  classifications 
is  listed  in  Table  12,  and  grid  cell  comparison  results  in  Table  13. 


Table  10 


Pennypack  Creek  LANDSAT  Land  Use 
Area  Table 


Land  Use  Category 

Acres 

% 

Natural  vegetation 

5,660 

15.9 

Pasture 

1,884 

5.3 

Agriculture 

1,076 

3.0 

Developed  Open  Space 

1,763 

4.9 

Residential 

24,405 

68.4 

Light  Industry 

182 

0.5 

Intermediate  Industry 

457 

1.3 

Heavy  Industry 

165 

0.7 

Water 

0 

0.0 

35,693 

100.0 

Table  11 


Pennypack  Creek  Conventional  Land  Use 
Area  Table 


Land  Use  Category 

Acres 

% 

Forest  and  Undeveloped 

7,499 

21.0 

Agriculture 

3,341 

9.4 

Recreation  and  Cultural 

1,832 

5.1 

Single  -  High  Value 

6,186 

17.3 

Single  -  Low  Value 

6,747 

18.9 

Twin  -  High  Value 

2,343 

6.6 

Twin  -  Medium  Value 

116 

0.3 

Twin  -  Low  Value 

54 

0.2 

Apartments  -  High  Value 

705 

2.0 

Apartments  -  Low  Value 

781 

2.2 

Community  Services  -  High  Value 

24 

0.1 

Community  Services  -  Low  Value 

1,296 

3.6 

Light  Industry 

619 

1.7 

Communications  and  Utilities 

154 

0.4 

Military 

1 

0.0 

Commercial  -  High  Value 

861 

2.4 

Commercial  -  Low  Value 

981 

2.7 

Mining 

35 

0.1 

Heavy  Industry 

922 

2.6 

Transportation 

910 

2.5 

Water  Bodies 

289 

0.8 

35,693  99.9 
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Figure  9 
Pennypack  Creek 
Conventional  Land  Use 


1 


Table  12 


Pennypack  Creek 
Land  Use  Category  Mapping 


Conventional  Land  Use 

LANDSAT 

Forest  and  Undeveloped 

Natural  Vegetation 

Agriculture 

Pasture 

Agriculture 

Recreation  and  Cultural 

Developed  Open  Space 

Single  -  High  Value 

Single  -  Low  Value 

Twin  -  High  Value 

Twin  -  Medium  Value 

Twin  -  Low  Value 

Apartments  -  High  Value 

Apartments  -  Low  Value 

Community  Services,  -  High  Value 
Community  Services  -  Low  Value 

Residential 

Light  Industry 

Communications  and  Utilities 

Military 

Light  Industry 

Conmercial  -  High  Value 

Commercial  -  Low  Value 

Mining 

Intermediate  Industrj 

Heavy  Industry 

Transportation 

Heavy  Industry 

Water  Bodies 

Water 

A 
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Pennypack  Creek  Land  Uae  Comparison 
LANDSAT  va .  Conventional* 


♦  Hote;  See  page  15  of  teat  for  explanation  of  table  structure 


The  most  noticable  conflicts  in  Table  13  occur  between  natural  vegetation 
and  residential,  and  between  agricultural  and  residential.  Diagonal 
summation  shows  50.8%  of  the  watershed  "correctly"  classified  by  LANDSAT. 

The  major  land  use  categories  are  (a)  natural  vegetation,  (b)  agricultural, 
(c)  developed  open  space,  (d)  residential,  and  (e)  industrial;  their 
corresponding  percent  differences  between  conventional  and  LANDSAT  are 
-5.1%,  -1.1%,  -0.2%,  17.3%,  and  -10.1%,  respectively.  Average  absolute  error 
for  the  major  categories  is  6.8%. 

Castro  Valley 

Castro  Valley  is  a  small  urban  watershed  (drainage  area  =5.7  mi2)  located 
in  the  San  Francisco  Bay  area.  LANDSAT  land  use  had  been  determined  by  UCD 
(Algazi,  1977a)  and  entered  into  a  grid  cell  data  bank  by  HEC.  The 
conventional  land  use  classification,  and  the  data  bank  itself,  were 
determined  by  HEC  staff  for  use  in  Corps  of  Engineers  training  activities. 
Sources  of  data  for  LANDSAT  analysis,  ground  truth,  and  conventional  land 
use  are  listed  below. 

LANDSAT  Imagery: 

-  30  Jan  1975,  ID#  1921-18022. 

Ground  Truth: 

-  NASA  high-altitude  color  infrared  aerial  photography,  date  and  scale 
unknown . 

USGS  7-1/2-minute  topographic  map,  photorevised  1973,  scale 
=  1:24,000. 

Conventional  Land  Use: 

USGS  7-1/2-minute  topographic  map,  photorevised  1973,  scale 
=  1:24,000. 

-  Alameda  County  Flood  Control  and  Water  Conservation  District,  Castro 
Valley  Drainage  Map,  date  unknown,  scale:  1"  =  500'. 

Area  tables  for  LANDSAT  and  conventional  appear  together  in  Table  14,  and 
line  printer  maps  are  given  separately  in  Figures  10  and  11.  Only  a  small 
amount  of  mapping  conventional  categories  into  LANDSAT  categories  was 
necessary,  as  shown  in  Table  15.  The  grid  cell  comparison  of  land  use  is 
shown  in  Table  16. 
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Table  14 

Castro  Valley 
Area  Tables 


LANDSAT  Land  Use 

Acres 

% 

Natural  Vegetation 

852 

23.3 

Developed  Open  Space 

25 

0.7 

Residential 

2,151 

58.9 

Commercial /Indus trial 

627 

17.2 

3,655  100.1 


Conventional  Land  Use 

Acres 

% 

Natural  Vegetation 

617 

16.9 

Developed  Open  Space 

165 

4.5 

Low  Density  Residential 

520 

14.2 

Med  Density  Residential 

1,763 

48.2 

High  Density  Residential 

301 

8.2 

Commercial 

171 

4.7 

Institutional 

109 

3.0 

Water 

9 

0.3 

3,655 


100.0 


Table  15 

Castro  Valley 
Land  Use  Category  Mapping 


Conventional  Land  Use 

LANDSAT  Land  Use 

Natural  Vegetation 

Natural  Vegetation 

Developed  Open  Space 

Developed  Open  Space 

Low  Density  Residential 

Med  Density  Residential 

High  Density  Residential 

Residential 

Commercial 

Institutional 

Commercial /Indus trial 

Water 

Water 
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Figure  11 
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X  Col 
X  Total 


Castro  Valley  Land  Use  Comparison 
LANDSAT  va.  Conventional* 


CONVENTIONAL 


IAT  VEG 

DEV  OP  SP 

RES  i 

COM/IND  1 

WATER 

476 

55 

312 

9 

0 

55.8 

6.5 

36.6 

1.1 

0.0 

77.2 

33.3 

12.1 

3.3 

0.0 

13.0 

1.5 

8.5 

0.3  1 

0.0 

The  largest  conflicts  in  Table  16  occur  between  the  conventional  residential 
class  and  LANDSAT  natural  vegetation,  developed  open  space,  and 
commercial/industrial  categories.  Diagonal  summation  shows  67.0%  of  the 
watershed  "correctly"  classified  by  LANDSAT.  The  four  major  classes  in  Table 
16  are  (a)  natural  vegetation,  (b)  developed  open  space,  (c)  residential, 
and  (d)  commercial/industrial;  LANDSAT  -  conventional  differences  are  6.4%, 
-3.8%,  -11.8%,  and  9.5%,  respectively.  Average  absolute  error  for  the  major 
land  use  categories  is  7.9%. 

Summary 

Results  from  the  comparison  of  LANDSAT  and  conventional  land  use  for  four 
watersheds  (Walnut  Creek,  Rowlett  Creek,  Pennnypack  Creek,  and  Castro 
Valley)  have  been  presented.  A  summary  of  the  comparison  at  the  grid  cell 
level  is  given  in  Table  17,  and  at  the  watershed  level  in  Table  18.  A 
general  pattern  in  the  error  evaluation  is  apparent:  individual  grid  cells 
were  incorrectly  classified  by  LANDSAT  1/3  of  the  time,  whereas  aggregation 
of  grid  cells  over  the  entire  watershed  showed  misclassification  of  the 
major  land  use  categories  averaging  2  to  8%. 


Table  17 

Accuracy  of  Landsat  Land  Use  Classification 
Grid  Cell  Level 


WATERSHED 

Walnut 
Rowlett 
Pennypack 
Castro  Valley 


Drainage  Area  (mi 2) 

55 
137 

56 
5.7 


ACCURACY  (%) 

52.7/70.8 
67.4 
50.8 
67.0 
=  61.7 


COMNERCIAL  LANDSAT  CLASSIFICATION 


In  addition  to  learning  and  using  the  UCD  Procedure,  HEC  staff  had  the 
opportunity  to  participate  in  the  LANDSAT  land  use  classification 
operations  at  two  commercial  firms:  The  Bendix  Corporation  and  Battelle 
Pacific  Northwest  Laboratories^.  Having  first-hand  knowledge  of  what  is 
available  commercially  would  allow  a  more  comprehensive  evaluation  of  the 
UCD  Procedure. 


Battelle  Pacific  Northwest  Laboratories 
Battelle  Boulevard 
Richmond,  Washington  99352 
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Table  18 


LANDSAT  Land  Use  Classification  Error  (%) 
Watershed  Level 


WALNUT 

Conv 

LANDSAT 

diff 

ROWLETT 

Conv 

LANDSAT 

diff 

Res 

17.4 

25.0 

7.6 

Res 

14.7 

20.0 

5.3 

Com/Ind 

9.5 

5.1 

-4.4 

Com/Ind 

4.6 

3.9 

-0.7 

Crop/Pasture/ 

70.2 

69.1 

-1.1 

Ag 

71.6 

70.8 

-0.8 

Fores t/Range 

Nat  Veg 

7.8 

5.3 

-2.5 

Avg.  Abs 

=  4.4 

Avg.  Abs  = 

2.3 

PENNYPACK 

Conv 

LANDSAT 

diff 

CASTRO  VLY 

Conv 

LANDSAT 

diff 

Nat  Veg 

21.0 

15.9 

-5.1 

Nat  Veg 

16.9 

23.3 

6.4 

Ag 

9.4 

8.3 

-1.1 

DevOpSp 

4.5 

0.7 

-3.8 

DevOpSp 

5.1 

4.9 

-0.2 

Res 

70.7 

58.9 

-11.8 

Res 

51.1 

68.4 

17.3 

Com/Ind 

7.7 

17.2 

9.5 

Ind 

12.6 

2.5 

-10.1 

Avg.  Abs 

=  6.8 

Avg.  Abs 

=  7.9 

Bendix 

HEC  staff  and  the  UCD  principal  investigator  visited  the  Bendix  facilities 
in  Ann  Arbor,  Michigan.  (Bendix  operations  have  since  been  taken  over  by  the 
Environmental  Research  Institute  of  Michigan!.)  Land  use  for  a  1,064 
square  mile  area  was  classified  from  LANDSAT  imagery  by  selecting 
representative  training  sites  throughout  the  San  Francisco  Bay  area.  The 
final  classification  was  checked  carefully  in  Castro  Valley. 

Output  products  included  color-coded  land  use  maps  and  a  resampled  computer 
tape2  for  selected  portions  of  the  classified  area.  The  land  use 
classification  of  Castro  Valley  was  compared  (at  the  watershed  level)  with 
other  published  commercial  classifications  of  the  basin  in  HEC's 
contribution  to  an  ASVT  interim  report  (Salmonson,  1977). 


1  ERIM 

Box  8618,  Ann  Arbor, 

Michigan  48107 

2  HEC  has  been  unable  to  read  the  resampled  tape,  as  it  was  not 
constructed  to  contract  specifications. 
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Battelle 


HEC  and  Fort  Worth  District  staff  participated  in  the  LANDSAT  classification 
of  Walnut  Creek  at  Battelle 's  Pacific  Northwest  Laboratories  in  Richland, 
Washington.  Training  sites  were  selected  from  the  same  LANDSAT  scene  that 
had  been  previously  analyzed  with  the  UCD  Procedure  at  HEC  (as  described 
earlier  in  this  report  in  the  Walnut  Creek  discussion).  The  final 
classification  was  identified  in  a  color-coded  land  use  map,  Figure  12,  and 
on  a  resampled  computer  tapel.  Table  19  contains  the  acreage2  and 
percent  distribution  of  Battelle' s  LANDSAT  land  use.  The  mapping  of  the 
Battelle  categories  into  conventional  and  UCD  LANDSAT  categories  is  shown  in 
Table  4.  Figure  13  is  a  line  printer  map  of  Battelle's  land  use. 

The  resampled  Battelle  tape  was  entered  into  the  Walnut  Creek  grid  cell  data 
bank.  Comparisons  between  Battelle  LANDSAT  and  conventional,  and  between 
Battelle  LANDSAT  and  UCD  LANDSAT  are  shown  in  Tables  20  and  21, 
respectively.  Summary  measures  of  comparison  are  provided  in  Table  22. 

Table  19 

Walnut  Creek  Battelle  LANDSAT  Land  Use 
Area  Table 


Land  Use  Category  Acres  % 


Barren  Land 

1,200 

3.5 

Quarry 

217 

0.6 

Industrial /Commerical 

611 

1.8 

Residential 

6,513 

18.8 

Cropland/Pasture 

9,536 

27.5 

Rangel  and 

8,597 

24.8 

Riparian  Vegetation 

4,663 

13.4 

Forest 

1,630 

4.7 

Water 

54 

0.2 

Transportation 

1,676 

4.8 

34,697  100.1 


^  After  the  tape  had  been  provided  to  HEC,  it  was  discovered  that  the 
wrong  corner  grid  cell  was  used  in  the  resampling.  A  corrected  tape  is 
reported  to  be  forthcoming  from  Battelle.  All  land  use  comparisons  of 
the  Battelle  classification  will  be  affected  by  this  known  resampling 
error. 

2  The  total  classified  land  area  listed  on  Battelle's  color-coded  map 
was  35,869  acres  (56.05  sq.  mi.).  When  the'Battelle  resampled  file  was 
entered  into  the  Walnut  Creek  grid  cell  data  bank  only  34,697  acres 
(54.21  sq.  mi.)  were  located  within  the  geographically  correct 
watershed  boundary,  the  latter  being  defined  in  the  data  bank  as 
containing  36,574  acres  (57.15  sq.  mi.).  This  discrepancy  was  probably 
due  to  the  resampling  problems  described  in  the  footnote  above. 
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Table  20 


Valnut  Creek  Land  Use  Comparison 
Battelle  LANDSAT  va.  Conventional* 


Walnut  Creek  Land  Use  Comparison 
Bat  telle  Land sat  vs.  UCD  LANDSAT* 


HfB 

Sj  UCD  LANDSAT 

ROW 

TOTAL 

RES 

COM/IND 

CROP/PASTURE 

FOREST/ 

RANGE 

water 

SB 

607 

32 

1839 

812 

5 

6513 

RES 

9.3 

0.5 

12.5 

0.1 

100.0 

KEfll 

36.1 

15.8 

8.6 

18.8 

K9 

1.7 

0.1 

■I 

2.3 

H 

18.8 

816 

212 

15 

■ 

395 

15 

2287 

COH/IND 

35.7 

9.3 

17.3 

0.7 

100.0 

9.3 

12.6 

4.2 

21.7 

6.6 

2.4 

0.6 

■ 

1 

1.1 

Hi 

6.6 

392 

116 

17 

566 

325 

i 

1417 

QUARRY 

27.7 

8.2 

1.2 

39.9 

22.9 

O.l 

100.0 

4.5 

6.9 

8.4 

3.9 

3.4 

1.4 

4.1 

1.1 

0.3 

0.0 

1.6 

0.9 

m 

4.1 

1734 

326 

31 

■m 

m 

15 

9537 

CROP/PASTURE 

18.2 

3.4 

0.3 

■ 

mmm 

0.2 

100.0 

19.8 

19.4 

15.3 

HUB 

21.7 

27.5 

5.0 

0.3 

0.1 

Weak 

0.0 

27.5 

2585 

419 

107 

6162 

5587 

30 

14890 

forest/aahce 

17.4 

2.8 

0.7 

41.4 

37.5 

0.2 

100.0 

29.5 

24.9 

42.3 

59.3 

43.5 

42.9 

7.5 

1.2 

17.8 

16.1 

0.1 

42.9 

9 

1 

0 

20 

21 

3 

WATER 

16.7 

1.9 

0.0 

37.0 

38.9 

5.6 

0.1 

0.1 

0.0 

0.1 

0.2 

4.3 

■Sgfl 

0.0 

0.0 

0.0 

0.1 

0.1 

0.0 

mm 

COLUMN 

8754 

1681 

202 

14570 

9422 

69 

34698 

TOTAL 

25.2 

4.8 

0.6 

42.0 

27.2 

0.2 

100.0 

100.0 

100.0 

100.0 

100.0 

100.0 

100.0 

100.0 

25.2 

4.8 

0.6 

42.0 

27.2 

0.2 

100.0 

NOTE:  See  page  10  of  text  for  explantation  of  table  structures 


Table  22 


Walnut  Creek  Battelle  LANDSAT  Land  Us 
Error  Summary 


Battelle  LANDSAT  vs.  Conventional 


(refer  to  TABLE  20) 

GRID  CELL  LEVEL: 

Diagonal  summation  =  44.3% 


WATERSHED  LEVEL: 


Battelle 

Conv. 

Res . 

18.8 

17.2 

Com/ Ind 

6.6 

8.9 

Crop/Pasture/Range 

52.3 

45.7 

Forest 

18.1 

25.2 

Avg.  Abs 


Battelle  LANDSAT  vs.  UCD  LANDSAT 
(refer  to  TABLE  21) 

GRID  CELL  LEVEL: 

Diagonal  Summation  =  40.8% 


WATERSHED  LEVEL: 


Battelle 

UCD 

Res . 

18.8 

25.2 

Com/ Ind 

6.6 

4.8 

Crop/Pasture 

27.5 

42.0 

Forest/Range 

42.9 

27.2 

1 


3 

! 

The  Battelle  LANDSAT  and  UCD  LANDSAT  demonstrated  similar  magnitudes  of  ! 

classification  error  at  the  watershed  level,  both  having  an  average  absolute 
error  of  4.4%  for  the  major  land  use  classes.  At  the  grid  cell  level  UCD 
LANDSAT  was  more  accurate,  having  "correctly"  classified  52.7  to  70.8%  of 
the  grid  cells  compared  to  Battelle's  44.3%. 

COST  OF  LANDSAT  CLASSIFICATION 

As  part  of  the  overall  ASVT  project,  Water  Resources  Engineers l  (WRE)  is 
responsible  for  a  cost-effectiveness  study  of  the  LANDSAT  determination  of 
land  use.  HEC  kept  careful  time  and  cost  records  during  the  classification 
of  Crow  Creek  and  Walnut  Creek  (using  the  UCD  Procedure).  The  cost  data 
provided  to  WRE  is  repeated  in  Table  23.  Five  weeks  of  engineer/ technician 
labor  were  required  for  each  watershed. 

Similarly,  cost  estimates  for  the  two  commercial  classifications  (Bendix  and 
Battelle)  are  given  in  Table  24. 

HYDROLOGIC  COMPARISON 

The  primary  reason  for  examining  the  land  use  classification  ability  of 
LANDSAT  was  for  its  potential  application  to  hydrologic  modeling. 

Calibration  of  hydrologic  models  typically  used  by  the  Corps  of  Engineers  in 
urban  areas  is  heavily  dependent  on  land  use  data,  particularly  in  basins 
where  land  use  is  changing  and  where  future  conditions  are  of  interest. 

The  computer  program  HEC-1  (U.S.  Army  Corps  of  Engineers,  1973)  has  the 
capability  of  explicitly  relating  land  use  to  runoff  using  two  procedures: 

Snyder's  unit  hydrograph  with  percent  imperviousness,  and  the  SCS  curve 
number  and  unit  hydrograph  (U.S.  Soil  Conservation  Service,  1972).  The 
HYDPAR  program  (U.S.  Army  Corps  of  Engineers,  1978d)  obtains  the  necessary 
information  from  a  grid  cell  data  bank  and  computes  the  specified  hydrologic 
parameters,  which  are  in  turn  input  into  an  HEC-1  model  of  the  basin. 

HYDPAR  contains  a  regression  equation  formulation  of  Snyder's  lag  as  a 
function  of  steam  length,  length  to  centroid  of  subbasin,  stream  slope,  and 
percent  imperviousness,  Figure  14.  A  table  associating  a  percent 
imperviousness  with  each  land  use  category  in  the  data  bank  enables  HYDPAR 
to  compute  subbasin  percent  imperviousness  from  subbasin  land  use 
distribution. 


*  Water  Resources  Engineers 
8001  Forbes  Place, 
Springfield,  Virginia  22151 
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Table  23 


UCD  Procedure 
Cost  Estimates  ($) 


Crow  Creek 


Computer 

Direct 

and 

Total 

Task 

Labor* 

Supplies 

Direct 

Indirect 

Preclassification 

173 

518 

691 

135 

Geometric  Registration 

173 

282 

455 

135 

Clustering 

773 

161 

935 

603 

Reclustering 

459 

111 

569 

358 

Postclassification 

312 

92 

404 

243 

1,890 

1,163 

3,053 

1,474 

Total  $4,527 


Walnut  Creek 


Computer 

Direct 

and 

Total 

Task 

Labor* 

Supplies 

Direct 

Indirect 

Preclassification 

145 

366 

511 

113 

Geometric  Registration 

405 

113 

518 

316 

Clustering 

616 

146 

763 

481 

Reclustering 

516 

46 

562 

402 

Postclassification 

266 

71 

337 

207 

1,947 

743 

2,690 

1,519 

Total  $4,209 


*  Base  salary  rate  was  $9.26  per  hour 


Table  24 


Commercial  Classification 
Cost  Estimates 


BENDIX  (1977) 


Contract  Administration  $  600 
Contract  $5,607 
Travel  $1,800 
LAND SAT  CCT  $  400 
Aerial  Photography  $  200 


TOTAL  $8,607 

BATTELLE  (1979) 

Contract  Administration  $1,000 

Contract  $2,979 

Travel  $2,280 

LAND SAT  CCT  $  200 

Aerial  Photography  $  160 


TOTAL  $6,619 


Figure  14 

Calculation  of  Snyder's  Lag  (TP) 

TP  (hours)  =  (C)  (X)C1(10)(C2)I 

Where  - 

C  =  Regression  constant  =  .927600 

X  =  (L)  (Z)  (S)'*5 

L  =  Stream  length  (miles) 

Z  =  LCA  (Length  to  centroid  in  miles) 
S  =  Stream  slope  (feet/mile) 

Cl  =  Regression  coefficient  =  .387600 

C2  =  Regression  coefficient  =  -.003735 

I  =  Percent  imperviousness 
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In  a  similar  manner  HYDPAR  can  determine  the  SCS  unit  hydrograph 
parameter  from  stream  length,  basin  average  land  slope,  and  subbasin 
average  cuve  number,  Figure  15.  Curve  numbers  represent  an  empirical 
relationship  between  hydrologic  soil  type,  land  use,  and  their 
resultant  runoff  potential.  From  a  table  identifying  a  curve  number 
with  each  combination  of  land  use  and  hydrologic  soil  type,  HYDPAR 
computes  subbasin  average  curve  number. 

Figure  15 

Calculation  of  SCS  Lag 


(l)°-8(s+D0-7 


LAG  (hours)  = 


(1900XY) 


0.5 


Where  - 

L  =  The  hydraulic  length  of  watershed  in  feet 

Y  =  Average  subbasin  land  slope  in  percent 

S  =  (1000/CN)-10 

where  CN  is  the  curve  number 


Rowlett  Creek 

Percent  imperviousness  was  assigned  to  Rowlett  Creek's  LANDSAT  and 
conventional  land  use  categories,  Tables  25  and  26,  respectively.  In 
addition,  two  variations  on  the  conventional  land  use  percent 
imperviousness  were  made:  (1)  imperviousness  values  for  the  LANDSAT 
categories  were  assigned  to  the  aggregated  conventional  land  use 
categories,  Table  27;  and  (2)  the  "developing"  land  use  category  of 
Table  26  was  changed  to  41%  imperviousness,  everything  else  remaining 
the  same.  The  first  variation  was  intended  to  test  the  sensitivity  of 
the  hydrologic  model  to  aggregation  of  land  use  categories  by  comparing 
the  full  22  category  conventional  land  use  results  with  those  generated 
from  a  reduced  6  category  conventional  classification.  The  second 
variation  was  designed  to  see  how  hydrologic  parameter  estimates  would 
change  when  the  "developing"  category  was  equated  to  "medium  density 
residential" . 


Table  25 


Rowlett  Creek  LANDSAT  Land  Use 
Percent  Imperviousness  Summary 


LAND  USE  PERCENT 


CATEGORY 

TITLE 

IMPERVIOUSNESS 

1 

Agricultural 

2 

2 

Natural  Vegetation 

0 

3 

Residential 

41 

4 

Commerical /Indus trial 

85 

LAND  USE 
CATEGORY 

Table  26 

Rowlett  Creek  Conventional  Land  Use 

Percent  Imperviousness  Summary 

PERCENT 

TITLE  IMPERVIOUSNESS 

1 

Low  Density  Residential 

25 

2 

Medium  Density  Residential 

41 

3 

High  Density  Residential 

47 

4 

Multifamily  Residential 

70 

5 

Mobile  Home  Parks 

20 

6 

Central  Business  District 

95 

7 

Strip  Comnercial 

90 

8 

Shopping  Centers 

95 

9 

Institutional,  School,  Church, 

etc. 

40 

10 

Industrial 

80 

11 

Transportation,  Major  Highways 

35 

12 

Communication,  Transformers 

35 

13 

Public  Utilities 

60 

14 

Strip  Settlement 

10 

15 

Parks  and  Developed  Open  Space 

6 

16 

Devloping 

15 

17 

Cropl and 

3 

18 

Grassland 

0 

19 

Confined  Feeding 

20 

20 

Woodlands,  Forest 

0 

21 

Water  Bodies 

100 

22 

Barren  Land,  Gravel  Pits 

0 

47 


Table  27 


Rowlett  Creek  Aggregated  Conventional  Land  Use 
Percent  Imperviousness  Summary 


LAND  USE 

CATEGORY  TITLE 


PERCENT 

IMPERVIOUSNESS 


1 

2 

3 

4 

5 

6 


Agricultural 
Natural  Vegetation 
Residential 
Commerical /Indus trial 
Developed  Open  Space 
Water 


2 

0 

41 

85 

6 

100 


A  calibrated  HEC-1  model  of  a  24.6  square  miles  portion  of  the  Rowlett  Creek 
basin,  referred  to  as  Upper  Spring  Creek  and  outlined  in  Figure  16,  was  used 
to  simulate  runoff  from  selected  recurrence  interval  rainfall.  HYDPAR 
computed  percent  imperviousness  and  Snyder's  lag  for  Upper  Spring  Creek's 
twenty-three  subbasins.  Four  sets  of  each  parameter  are  shown  in  Table  28: 
(1)  conventional  22  category  with  "developing"  =  15%  impervious;  (2) 
conventional  22  category  with  "developing"  =  41%  impervious;  (3) 
conventional  6  category;  and  (4)  LANDSAT.  Percent  imperviousness  is 
generally  consistent  between  conventional  22  category  and  LANDSAT.  When 
substantial  differences  exist  between  the  two  it  can  usually  be  explained  by 
the  "developing"  percent  imperviousness  or  the  aggregation  effect.  Because 
percent  imperviousness  is  only  one  of  several  factors  which  determine 
Snyder's  lag  (see  Figure  14),  differences  between  conventional  and  LANDSAT 
estimates  of  subbasin  lag  are  small. 

The  calibrated  HEC-1  model  (using  the  two  land  use  estimates  of  Snyder's 
lag)  and  synthetic  rainfall  produced  the  discharge  values  plotted  in  Figure 
17  for  selected  stations  in  the  Upper  Spring  Creek  drainage.  Differences 
between  such  discharge  frequency  curves  can  be  interpreted  as  a  measure  of 
the  hydrologic  significance  of  LANDSAT' s  misclassification  of  land  use. 
Considering  the  uncertainty  involved  in  estimating  a  frequency  curve  (even 
from  observed  data),  the  difference  between  LANDSAT  and  conventional  curves 
is  insignificant. 

Pennypack  Creek 

The  SCS  curve  number  method  was  used  to  model  the  Pennypack  Creek  basin. 
Curve  numbers  for  conventional  21  categories,  IANDSAT  9  categories,  and 
conventional  aggregated  to  9  categories  are  given  in  Tables  29,  30,  and  31, 
respectively.  For  each  of  Pennypacks  sixty-five  subbasins  HYDPAR  calculated 
subbasin  average  curve  number  and  subbasin  lag  (according  to  the  equation  of 
Figure  15).  Also,  subbasin  average  curve  number  and  lag  were  calculated  for 
(a)  all  land  use  categories  assigned  the  industrial  category  curve  nvsnbers, 
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Figure  17 

Rowlett  Creek 

Upper  Spring  Creek  Subbasin 

Annual  Peak  Discharge  Frequency 
(selected  stations) 


Conventional  hand  Use 
l.ANDSAT  Land  Use 


and  (b)  all  land  use  categories  assigned  the  natural  vegetation  curve 
numbers.  Parameters  estimated  in  these  two  cases,  and  the  discharge 
frequency  curves  derived  from  them,  demonstrate  the  possible  extremes  (in 
terms  of  runoff)  that  could  have  been  generated  from  the  model.  Estimates 
for  the  five  conditions  described  above  are  listed  in  Table  32.  Subbasin 
curve  number  and  lag  for  conventional,  LANDSAT,  and  aggregated  conventional 
are  seen  to  have  nearly  identical  values. 

Table  29 
Pennypack  Creek 

SCS  Curve  Numbers  for  Conventional  Land  Use 


LAND  USE 

HYDROLOGIC 

SOIL  GROUP 

CATEGORY 

TITLE 

A 

B 

C 

D 

1 

Residential  (Single)  -  High  Value 

61. 

76. 

83. 

87. 

2 

Residential  (Single)  -  Low  Value 

61. 

76. 

83. 

87. 

3 

Residential  (Twins)  -  High  Value 

65. 

78. 

85. 

88. 

4 

Residential  (Twins)  -  Median  Value 

65. 

78. 

85. 

88. 

5 

Residential  (Twins)  -  Low  Value 

65. 

78. 

85. 

88. 

6 

Residential  (Apts)  -  High  Value 

81. 

88. 

91. 

93. 

7 

Residential  (Apts)  -  Low  Value 

81. 

88. 

91. 

93. 

8 

Light  Industry 

81. 

88. 

91. 

93. 

9 

Heavy  Industry 

81. 

88. 

91. 

93. 

10 

Transportation 

92. 

94. 

95. 

96. 

11 

Communication  and  Utilities 

74. 

85. 

88. 

91. 

12 

Commercial  -  High  Value 

89. 

92. 

94. 

95. 

13 

Commercial  -  Low  Value 

89. 

92. 

94. 

95. 

14 

Community  Services  -  High  Value 

60. 

74. 

82. 

86. 

15 

Community  Services  -  Low  Value 

60. 

74. 

82. 

86. 

16 

Military 

0. 

0. 

0. 

0. 

17 

Recreation  and  Cultural 

39. 

61. 

74. 

80. 

18 

Agriculture 

62. 

75. 

83. 

84. 

19 

Mining 

79. 

82. 

84. 

85. 

20 

Forest  and  Undeveloped  Land 

40. 

63. 

75. 

81. 

21 

Water  Areas 

100. 

100. 

100. 

100. 
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Table  30 


Pennypack 

Creek 

SCS  Curve  Numbers  for 

LANDSAT  Land  Use 

LAND  USE 

HYDROLOGIC 

SOIL  i 

GROUP 

CATEGORY 

TITLE 

A 

B 

C 

D 

1 

Natural  Veg 

40. 

63. 

75. 

81. 

2 

Pasture 

62. 

75. 

83. 

84. 

3 

Agricultural 

62. 

75. 

83. 

84. 

4 

Developed  Open  Space 

39. 

61. 

74. 

80. 

5 

Residential 

65. 

78. 

85. 

88. 

6 

Light  Industry 

81. 

88. 

91. 

93. 

7 

Intermediate  Industry 

81. 

88. 

91. 

93. 

8 

Heavy  Industry 

81. 

88. 

91. 

93. 

9 

Water 

100. 

100. 

100. 

100. 

Table  31 
Pennypack  Creek 

SCS  Curve  Numbers  for  Aggergated  Conventional  Land  Use 


LAND  USE  HYDROLOGIC  SOIL  GROUP 


CATEGORY 

TITLE 

A 

B 

C 

D 

1 

Residential  (Single)  -  High  Value 

65. 

78. 

85. 

88. 

2 

Residential  (Single)  -  Low  Value 

65. 

78. 

85. 

88. 

3 

Residential  (Twins)  -  High  Value 

65. 

78. 

85. 

88. 

4 

Residential  (Twins)  -  Median  Value 

65. 

78. 

85. 

88. 

5 

Residential  (Twins)  -  Low  Value 

65. 

78. 

85. 

88. 

6 

Residential  (Apts)  -  High  Value 

65. 

78 

85. 

88. 

7 

Residential  (Apts)  -  Low  Value 

65. 

78. 

85. 

88. 

8 

Light  Industry 

81. 

88. 

91. 

93. 

9 

Heavy  Industry 

92. 

94. 

95. 

96. 

10 

Transportation 

92. 

94. 

95. 

96. 

11 

Communication  and  Utilities 

81. 

88. 

91. 

93. 

12 

Commercial  -  High  Value 

84. 

90. 

92. 

94. 

13 

Commercial  -  Low  Value 

84. 

90. 

92. 

94. 

14 

Community  Services  -  High  Value 

65. 

78. 

85. 

88. 

15 

Community  Services  -  Low  Value 

65. 

78. 

85. 

88. 

16 

Military 

0. 

0. 

0. 

0. 

17 

Recreation  and  Cultural 

39. 

61. 

74. 

80. 

18 

Agriculture 

62. 

75. 

83. 

84. 

19 

Mining 

79. 

82. 

84. 

85. 

20 

Forest  and  Undeveloped  Land 

40. 

63. 

75. 

81 . 

21 

Water  Areas 

100. 

100. 

100. 

100. 
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32  (Continued) 


The  calibrated  HEC-1  model  of  Pennypack  Creek  simulated  the  basin's 
discharge  frequency  behavior  for  conventional,  LANDSAT,  all  industrial,  and 
all  natural  vegetation  conditions.  Resulting  frequency  curves  for  selected 
subbasins  and  the  entire  drainage  area  are  shown  as  Figures  18-21.  It  is 
clear  from  these  figures,  especially  with  reference  to  what  could  have  been 
(i.e.,  all  industrial  and  all  natural  vegetation  conditions),  that 
differences  between  LANDSAT  and  conventionally  derived  frequency  curves  are 
not  significant. 

Castro  Valley 


An  HEC-1  model  of  Castro  Valley  divided  the  watershed  into  the  four 
subbasins  shown  in  Figure  22.  Similar  to  the  Pennypack  Creek  analysis,  SCS 
curve  numbers  were  assigned  to  conventional  (Table  33)  and  LANDSAT  (Table 
34)  land  use  categories,  HYDPAR  was  used  to  estimate  subbasin  average  curve 
number  and  lag  (Table  35),  and  runoff  for  selected  recurrence  intervals  was 
simulated  (Table  36)  by  the  calibrated  HEC-1  model.  As  before,  discharge 
frequency  values  derived  from  conventional  and  LANDSAT  land  use  were  nearly 
identical . 


Table  33 


Castro  Valley 

SCS  Curve  Numbers 

for  Conventional 

Land 

Use 

LAND  USE 

HYDROLOGIC 

SOIL  GROUP 

CATEGORY 

TITLE 

A 

B 

C 

D 

1 

Natural  Vegetation 

39. 

61. 

74. 

80. 

2 

Developed  Open  Space 

49. 

69. 

79. 

84. 

3 

Lo  Den  Residential 

57. 

72. 

81. 

86. 

4 

Med  Den  Residential 

61. 

75. 

83. 

87. 

5 

Hi  Den  Residential 

77. 

85. 

90. 

92. 

6 

Conmercial 

89. 

92. 

94. 

95. 

7 

Institutional 

69. 

80. 

86. 

89. 

8 

Water 

100. 

100. 

100. 

100. 

57 


1000 


DISCHARGE  (cf«) 


59 


1000 


DISCHARGE  (1000  eft) 


1000 


DISCHARGE  (10 00  ef»i 


95  90 


EXCEEDANCE  FREQUENCY,  in  p*rc*nt 
80  70  60  50  40  30  20 


10 


5 


2  l  .5  .2  .1 
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61 


500 


Table  34 
Castro  Valley 

SCS  Curve  Numbers  for  LANDSAT  Land  Use 


LAND  USE 

HYDROLOGIC 

SOIL  GROUP 

CATEGORY 

TITLE 

A 

B 

C 

D 

1 

Natural  Vegetation 

39. 

61. 

74. 

80 

2 

Developed  Open  Space 

49. 

69. 

79. 

84 

3 

Residential 

61. 

75. 

83. 

87 

4 

Commercial/ Industrial 

85. 

90. 

93. 

94 

5 

Water 

100. 

100. 

100. 

100 

Table  35 
Castro  Valley 

Comparison  of  Hydrologic  Model  Parameters 


SUB- 

BAS  IN 

CURVE  NUMBER 

LAG 

CONVENTIONAL 

LANDSAT 

CONVENTIONAL 

LANDSAT 

1 

85.9 

86.3 

1.05 

1.03 

2 

84.4 

84.6 

.51 

.51 

3 

87.9 

87.7 

.48 

.49 

4 

76.1 

76.8 

.42 

.41 

Table  36 
Castro  Valley 

Discharge  Frequency  Comparison 


SUBBASINS 

DA 

(sq.  mi.) 

RECURRANCE  INTERVAL 

2-YR 

10- YR 

25-YR 

100- YR 

2 

2.20 

504 

1,019 

1,320 

1,783  Conv. 

513 

1,032 

1,334 

1 , 798  LANDSAT 

2,3 

3.11 

595 

1,166 

1,477 

1,948 

603 

1,179 

1,490 

1,966 

4 

1.01 

106 

295 

418 

619 

118 

317 

446 

652 

M 

2.45 

329 

700 

921 

1,270 

348 

730 

956 

1,311 

1,2, 3, 4 

5.56 

905 

1,829 

2,353 

3,169  Conv. 

935 

1,876 

2,409 

3,237  LANDSAT 

SUMMARY  AND  CONCLUSIONS 


HEC'8  participation  in  the  NASA  Water  Management  and  Control  ASVT  has 
included  several  tasks,  all  of  which  have  been  completed.  An  operational 
procedure  for  determining  land  use  from  LANDSAT  imagery  has  been  applied  to 
five  watersheds:  Crow  Creek,  Walnut  Creek,  Rowlett  Creek,  Pennypack  Creek, 
and  Castro  Valley  (UCD  staff  performing  the  work  on  the  latter  three 
watersheds).  Detailed  grid  cell  comparisons  between  conventional  and  LANDSAT 
derived  land  use  were  made  for  all  except  Crow  Creek.  The  alternative  land 
use  classifications  were  also  compared  hydrologically  for  all  except  Crow 
and  Walnut  Creeks.  Land  use  for  the  San  Francisco  Bay  area,  including  Castro 
Valley,  and  Walnut  Creek  was  determined  commercially  by  Bendix  Corporation 
and  Battelle  Pacific  Northwest  Labs,  respectively.  Grid  cell  comparison  of 
Walnut  Creek  was  repeated  for  the  Battelle  classification.  Cost  and  time 
records  for  the  LANDSAT  analysis  of  Crow  and  Walnut  Creeks  were  maintained 
and  provided  to  NASA's  cost-effectiveness  study  contractor. 

Based  on  this  work  the  following  conclusions  can  be  made: 

(1)  At  the  grid  cell  level  LANDSAT  land  use  was  in  error  approximately 
1/3  of  the  time. 

(2)  By  aggregating  land  use  over  the  entire  watershed,  LANDSAT' s 
average  misclassif ication  of  land  use  reduces  to  2  to  8%  for  the  major  land 
use  categories. 

(3)  The  commercial  (Battelle)  classification  was  less  accurate  at  the 
grid  cell  level  than  the  UCD  procedure  classification;  errors  at  the 
watershed  level  were  nearly  the  same  for  both. 

(4)  Evaluated  in  terms  of  the  difference  in  discharge  frequency  curves 
derived  using  the  same  hydrologic  model  but  different  land  use  (LANDSAT  and 
conventional),  the  LANDSAT  derived  land  use  was  found  to  be  completely 
adequate.  The  number  and  type  of  land  use  categories  derived  from  LANDSAT 
data  were  sufficient  to  be  able  to  apply  two  standard  hydrologic  modeling 
techniques:  Snyder's  unit  hydrograph  with  percent  imperviousness  and  the  SCS 
curve  number  method. 

(5)  The  UCD  procedure  works.  It  is  a  complete,  self-contained  package 
of  computer  programs  and  manual  operations  that  permit  a  user  to  identify 
land  use  from  LANDSAT  digital  data  without  requiring  the  use  of  expensive 
interactive  image  processing  equipment. 

(6)  LANDSAT  land  use  can  be  directly  incorporated  into  a  grid  cell  data 
bank,  thus  providing  an  automated  environment  for  applying  the  LANDSAT 
classification  in  routine  hydrologic  investigations. 
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Appendix  B 
WHAT  IS  LANDSAT?! 

LANDSAT  is  the  name  of  a  series  of  NASA  satellites  intended  to  provide  earth 
resources  information  to  the  general  scientific  and  environmental  planning 
community.  Each  Landsat  flies  in  a  nearly  polar  orbit  570  miles  (920  km) 
above  the  earth's  surface,  circling  the  earth  every  103  minutes  or 
approximately  14  times  per  day.  The  spacecraft  views  the  earth  at  the  same 
local  time,  roughly  9:30  a.m.  at  the  equator,  on  each  pass.  Repeat  coverage 
from  a  single  satellite  occurs  every  18  days.  LANDSAT-2  and  -3  orbits  are 
separated  by  nine  days  providing  the  potential  of  9-day  repetitive  coverage 
of  the  earth's  surface  from  space. 

The  multispectral  scanners  (MSS),  LANDSAT 's  primary  onboard  sensor  system, 
measure  the  earth's  reflected  sunlight  in  four  spectral  bands: 

band  4  (green)  0.5-0. 6  pm,  emphasizes  movement  of  sediment-laden  water 
and  delineates  areas  of  shallow  water,  such  as  shoals,  reefs,  etc; 

-band  5  (red)  0.6-0. 7  pm,  emphasizes  cultural  features,  such  as 
metropolitan  areas; 

-band  6  (near-infrared)  0.7-0. 8  >un,  emphasizes  vegetation,  the  boundary 
between  land  and  water,  and  landforms; 

band  7  (near-infrared)  0.8-1. 1  pm,  provides  the  best  penetration  of 
atmospheric  haze  and  also  emphasizes  vegetation,  the  boundary  between 
land  and  water,  and  landforms. 

The  latest  satellite,  LANDSAT-3,  has  an  additional  band  8  that  senses 
emitted  (not  reflected)  thermal  infrared  radiation  in  the  range  10.4-12.6 
pm.  Band  8  measures  temperatures  between  -13°C  and  67°C  and 
discriminates  relative  temperature  differences  as  small  as  1.5°C.  This 
band  is  expected  to  be  useful  for  monitoring  volcanic  action,  power  plant 
discharges  of  hot  water,  and  geysers. 

The  light  reflectance  data  obtained  by  the  MSS  sensing  system  are  first 
converted  to  electrical  signals,  which  vary  in  proportion  to  the  intensity 
measured  for  each  band.  These  analog  signals  are  then  converted  into  a 
digital  form  and  transmitted  to  receiving  stations  on  Earth. 

The  digital  data  may  be  either  reformatted  into  computer  compatible  tapes 
(CCTs)  or  reconverted  into  sets  of  black  and  white  photo  images.  A  LANDSAT 
scene  is  an  imaged  ground  area  normally  185  km  (115  mi)  crosstrack  by  170  km 
in  the  spacecraft  direction.  A  LANDSAT  scene  stored  on  a  CCT  contains  2,340 
scan  lines  with  3,240  columns  (samples)  each,  an  aggregate  of  over  7.5 
million  data  elements,  called  pixels.  Each  pixel  has  a  numerical  reflectance 

1  Most  of  this  section  was  extracted  directly  from  the  following 
references:  Holkenbrink  (1978),  LANDSAT  Data  Users  NOTES,  Short  (1976),  Todd 
(1978)  and  Watkins  (1978). 
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value  in  each  of  the  four  bands.  The  nominal  area  of  a  pixel  is  0.45  ha  (57 
x  79  m)  or  1 . 11  acres. 

LANDSAT  photographic  imagery  is  derived  from  Che  digital  data.  Relative 
reflectance  values  are  assigned  gray  levels  for  the  production  of  bands  4, 

5,  6  and  7  scene  images.  Color  images  are  made  from  combinations  of 
individual  black  and  white  images  by  projecting  each  given  band  through  a 
particular  filter.  In  this  rendition,  called  a  false  color  image,  growing 
vegetation  will  appear  in  various  shades  of  red,  rocks  and  soils  will 
normally  show  colors  ranging  from  bluish  through  yellows  and  browns,  water 
will  stand  out  as  blue  to  black  depending  on  depth  and  amount  of  suspended 
sediment,  and  cultural  features  (towns  and  roads)  will  usually  be  recognized 
by  bluish-black  tones  arranged  in  characteristic  patterns.  Photographic 
imagery  (paper  prints)  are  commonly  available  in  scales  ranging  from  1:10^ 
to  1:250,000. 


Appendix  C 


CLUSTER  ANALYSISl 

The  aim  of  any  clustering  algorithm  is  to  partition  a  given  set  of 
multivariate  data  points,  with  little  or  no  knowledge  about  the  actual 
distribution  of  the  data,  into  disjoint  sets  ("clusters")  of  similar  data 
points.  ISOCLAS,  the  clustering  program  utilized  in  the  UCD  Procedure, 
attempts  to  partition  the  four-dimensional  data  space2  into  clusters  of 
points  having  spectral  reflectance  values  that  are  (1)  similar  to  members  of 
the  same  cluster,  and  (2)  dissimilar  to  the  points  of  other  clusters.  Before 
describing  the  actual  algorithm,  some  general  terms  must  be  explained. 

The  center  of  the  Ith  cluster  is  represented  by  the  mean, 
of  all  points  belonging  to  that  cluster. 

Distance,  d,  from  a  data  point  Xk=  (Xkl,  Xk2,  xk3,  xk4)  to 
the  Ith  cluster  is  defined  as 

-Standard  deviation,  Or(l),  of  the  cluster  is  equivalent  to  the 
standard  deviation  of  all  points  belonging  to  the  cluster. 

-Mean  and  standard  deviation  of  a  cluster  are  four-dimensional  arrays, 
whereas  distance  from  point  to  cluster  is  a  single  number. 

The  clustering  process  starts  with  all  the  data  assigned  to  one  cluster.  The 
cluster  is  split  into  two  new  clusters  along  the  dimension  with  the  largest 
standard  deviation.  Cluster  centers  are  estimated  for  the  two  new  clusters 
and  distance  from  each  data  point  to  these  centers  is  determined.  Points  are 
reassigned  to  the  nearest  cluster.  Means,  standard  deviations  and  point 
counts  of  the  newly  developing  clusters  are  calculated.  The  splitting 
process  continues  until  at  least  80Z  of  the  clusters  are  not  eligible  for 
splitting;  i.e.,  when  max  (<r(I))  is  less  than  a  specified  threshold  value 
of  STDMAX. 

The  program  then  alternates  between  combine,  split,  and  delete  iterations. 
Two  clusters  are  combined  if  the  distance  between  their  centers  is  less  than 
a  threshold  parameter  DLMIN.  A  cluster  is  deleted  whenever  it  has  fewer  than 
NMIN  members.  The  classification  terminates  when  at  least  one  of  the 
following  conditions  is  satisfied:  (1)  splitting  and  combining  are  no  longer 
permitted  because  of  their  respective  thresholds;  (2)  two  consecutive 
iterations  are  performed  with  N  percent  or  fewer  of  the  data  points  involved 
in  a  cluster  reassignment;  or  (3)  the  maximum  number  of  cluster  iterations, 
ISTOP,  is  reached. 

1  Parts  of  this  section  were  extracted  from  Minter  (1977). 

2  One  dimension  for  each  of  the  multispectral  scanner's  (MSS)  four  bands. 


